Data Mining Opportunities In Very Large
نویسنده
چکیده
Information overload can hamper extracting hidden knowledge from a database. Data mining techniques ooer automated exploratory data analysis of databases. The mining process reveals knowledge buried in the data and provides insights into these data. Conventional mining techniques such as neural nets, regression analysis, and the discovering of rules uncover hidden information from user-provided training examples. A user indicates several of objects which are of interest to him (positive training examples) and few objects which are of no interest to him (negative training examples). The system then automatically discovers all interesting objects. In this position paper we present data mining opportunities that arise in very large object oriented databases, wherein the users no longer need to provide training examples. Data mining is achieved by user set priorities, user speciied guessed queries, and the notion of complete known information (i.e. information closure) about an object. From these inputs, hidden knowledge buried among huge amounts of information can be extracted in a user-driven manner.
منابع مشابه
Statistical Themes and Lessons for Data
Data mining is on the interface of Computer Science and Statistics, utilizing advances in both disciplines to make progress in extractinginformationfrom large databases. It is an emerging eld that has attracted much attention in a very short period of time. This article highlights some statistical themes and lessons that are directly relevant to data mining and attempts to identify opportunitie...
متن کاملA Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis
Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...
متن کاملA Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis
Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...
متن کاملComputational Challenges in Data Mining
Data mining is applied in business to find new market opportunities from data stored in operational data bases which are used for dayto-day management. The tools applied combine ideas from statistics, machine learning, data base technology and high performance computing to find nuggets of knowledge. Data mining is also applied in science for example to find taxonomies of variable stars and in t...
متن کاملNew Opportunities in Marketing Data Mining
INTRODUCTION Data mining has been widely applied in many areas over the past two decades. In marketing, many firms collect large amount of customer data to understand their needs and predict their future behavior. This chapter discusses some of the key data mining problems in marketing and provides solutions and research opportunities.
متن کامل